Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 2500 |
| Missing cells | 95 |
| Missing cells (%) | 0.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 293.1 KiB |
| Average record size in memory | 120.1 B |
Variable types
| CAT | 8 |
|---|---|
| NUM | 7 |
Interest.Rate has a high cardinality: 275 distinct values | High cardinality |
Debt.To.Income.Ratio has a high cardinality: 1669 distinct values | High cardinality |
Amount.Funded.By.Investors is highly correlated with Amount.Requested | High correlation |
Amount.Requested is highly correlated with Amount.Funded.By.Investors | High correlation |
Employment.Length has 77 (3.1%) missing values | Missing |
Debt.To.Income.Ratio is uniformly distributed | Uniform |
LoanID has unique values | Unique |
Revolving.CREDIT.Balance has 39 (1.6%) zeros | Zeros |
Inquiries.in.the.Last.6.Months has 1249 (50.0%) zeros | Zeros |
Reproduction
| Analysis started | 2020-12-03 09:28:43.264286 |
|---|---|
| Analysis finished | 2020-12-03 09:29:08.497785 |
| Duration | 25.23 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 2500 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1250.5 |
|---|---|
| Minimum | 1 |
| Maximum | 2500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 19.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 125.95 |
| Q1 | 625.75 |
| median | 1250.5 |
| Q3 | 1875.25 |
| 95-th percentile | 2375.05 |
| Maximum | 2500 |
| Range | 2499 |
| Interquartile range (IQR) | 1249.5 |
Descriptive statistics
| Standard deviation | 721.8321596 |
|---|---|
| Coefficient of variation (CV) | 0.5772348338 |
| Kurtosis | -1.2 |
| Mean | 1250.5 |
| Median Absolute Deviation (MAD) | 625 |
| Skewness | 0 |
| Sum | 3126250 |
| Variance | 521041.6667 |
| Monotocity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 1208 | 1 | < 0.1% | |
| 1222 | 1 | < 0.1% | |
| 1220 | 1 | < 0.1% | |
| 1218 | 1 | < 0.1% | |
| 1216 | 1 | < 0.1% | |
| 1214 | 1 | < 0.1% | |
| 1212 | 1 | < 0.1% | |
| 1210 | 1 | < 0.1% | |
| 1206 | 1 | < 0.1% | |
| Other values (2490) | 2490 | 99.6% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2500 | 1 | < 0.1% | |
| 2499 | 1 | < 0.1% | |
| 2498 | 1 | < 0.1% | |
| 2497 | 1 | < 0.1% | |
| 2496 | 1 | < 0.1% |
| Distinct | 380 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12405.46218 |
|---|---|
| Minimum | 1000 |
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 19.5 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 2867.5 |
| Q1 | 6000 |
| median | 10000 |
| Q3 | 17000 |
| 95-th percentile | 28000 |
| Maximum | 35000 |
| Range | 34000 |
| Interquartile range (IQR) | 11000 |
Descriptive statistics
| Standard deviation | 7802.933666 |
|---|---|
| Coefficient of variation (CV) | 0.6289917739 |
| Kurtosis | 0.3073694982 |
| Mean | 12405.46218 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 0.9133255208 |
| Sum | 31001250 |
| Variance | 60885773.8 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 10000 | 206 | 8.2% | |
| 12000 | 151 | 6.0% | |
| 5000 | 110 | 4.4% | |
| 20000 | 107 | 4.3% | |
| 6000 | 103 | 4.1% | |
| 15000 | 97 | 3.9% | |
| 8000 | 90 | 3.6% | |
| 25000 | 65 | 2.6% | |
| 7000 | 54 | 2.2% | |
| 16000 | 53 | 2.1% | |
| Other values (370) | 1463 | 58.5% |
| Value | Count | Frequency (%) | |
| 1000 | 13 | 0.5% | |
| 1125 | 1 | < 0.1% | |
| 1200 | 6 | 0.2% | |
| 1400 | 3 | 0.1% | |
| 1450 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 35000 | 51 | 2.0% | |
| 34500 | 1 | < 0.1% | |
| 33600 | 3 | 0.1% | |
| 33500 | 1 | < 0.1% | |
| 33000 | 1 | < 0.1% |
| Distinct | 710 |
|---|---|
| Distinct (%) | 28.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12002.37419 |
|---|---|
| Minimum | -0.01 |
| Maximum | 35000 |
| Zeros | 4 |
| Zeros (%) | 0.2% |
| Memory size | 19.5 KiB |
Quantile statistics
| Minimum | -0.01 |
|---|---|
| 5-th percentile | 2200 |
| Q1 | 6000 |
| median | 10000 |
| Q3 | 16000 |
| 95-th percentile | 27925 |
| Maximum | 35000 |
| Range | 35000.01 |
| Interquartile range (IQR) | 10000 |
Descriptive statistics
| Standard deviation | 7746.767348 |
|---|---|
| Coefficient of variation (CV) | 0.6454362469 |
| Kurtosis | 0.4144091354 |
| Mean | 12002.37419 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 0.9320839115 |
| Sum | 29993933.09 |
| Variance | 60012404.34 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 10000 | 163 | 6.5% | |
| 12000 | 108 | 4.3% | |
| 5000 | 87 | 3.5% | |
| 6000 | 85 | 3.4% | |
| 8000 | 69 | 2.8% | |
| 15000 | 68 | 2.7% | |
| 20000 | 59 | 2.4% | |
| 7000 | 40 | 1.6% | |
| 4000 | 35 | 1.4% | |
| 16000 | 35 | 1.4% | |
| Other values (700) | 1750 | 70.0% |
| Value | Count | Frequency (%) | |
| -0.01 | 2 | 0.1% | |
| 0 | 4 | 0.2% | |
| 200 | 1 | < 0.1% | |
| 214.02 | 1 | < 0.1% | |
| 224.99 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 35000 | 31 | 1.2% | |
| 34977.35 | 1 | < 0.1% | |
| 34975 | 5 | 0.2% | |
| 34950 | 6 | 0.2% | |
| 34900 | 1 | < 0.1% |
| Distinct | 275 |
|---|---|
| Distinct (%) | 11.0% |
| Missing | 2 |
| Missing (%) | 0.1% |
| Memory size | 19.5 KiB |
| 12.12% | 122 |
|---|---|
| 7.90% | 119 |
| 13.11% | 115 |
| 15.31% | 76 |
| 14.09% | 72 |
| Other values (270) |
| Value | Count | Frequency (%) | |
| 12.12% | 122 | 4.9% | |
| 7.90% | 119 | 4.8% | |
| 13.11% | 115 | 4.6% | |
| 15.31% | 76 | 3.0% | |
| 14.09% | 72 | 2.9% | |
| 14.33% | 69 | 2.8% | |
| 8.90% | 64 | 2.6% | |
| 11.14% | 58 | 2.3% | |
| 6.03% | 57 | 2.3% | |
| 17.27% | 56 | 2.2% | |
| Other values (265) | 1690 | 67.6% |
Frequencies of value counts
Unique
| Unique | 69 ? |
|---|---|
| Unique (%) | 2.8% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.7536 |
| Min length | 3 |
Loan.Length
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.5 KiB |
| 36 months | |
|---|---|
| 60 months |
| Value | Count | Frequency (%) | |
| 36 months | 1952 | 78.1% | |
| 60 months | 548 | 21.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Loan.Purpose
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.5 KiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| other | |
| home_improvement | |
| major_purchase | 101 |
| Other values (9) |
| Value | Count | Frequency (%) | |
| debt_consolidation | 1307 | 52.3% | |
| credit_card | 444 | 17.8% | |
| other | 201 | 8.0% | |
| home_improvement | 152 | 6.1% | |
| major_purchase | 101 | 4.0% | |
| small_business | 87 | 3.5% | |
| car | 50 | 2.0% | |
| wedding | 39 | 1.6% | |
| medical | 30 | 1.2% | |
| moving | 29 | 1.2% | |
| Other values (4) | 60 | 2.4% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 14.3132 |
| Min length | 3 |
| Distinct | 1669 |
|---|---|
| Distinct (%) | 66.8% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 19.5 KiB |
| 0% | 8 |
|---|---|
| 12.54% | 6 |
| 17.95% | 5 |
| 12.20% | 5 |
| 15.60% | 5 |
| Other values (1664) |
| Value | Count | Frequency (%) | |
| 0% | 8 | 0.3% | |
| 12.54% | 6 | 0.2% | |
| 17.95% | 5 | 0.2% | |
| 12.20% | 5 | 0.2% | |
| 15.60% | 5 | 0.2% | |
| 22.74% | 5 | 0.2% | |
| 17% | 5 | 0.2% | |
| 15.88% | 5 | 0.2% | |
| 12.85% | 5 | 0.2% | |
| 16.73% | 5 | 0.2% | |
| Other values (1659) | 2445 | 97.8% |
Frequencies of value counts
Unique
| Unique | 1088 ? |
|---|---|
| Unique (%) | 43.5% |
Histogram of lengths of the category
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.6884 |
| Min length | 2 |
State
Categorical
| Distinct | 46 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 19.5 KiB |
| CA | |
|---|---|
| NY | |
| TX | |
| FL | |
| IL | 101 |
| Other values (41) |
| Value | Count | Frequency (%) | |
| CA | 433 | 17.3% | |
| NY | 255 | 10.2% | |
| TX | 174 | 7.0% | |
| FL | 169 | 6.8% | |
| IL | 101 | 4.0% | |
| GA | 98 | 3.9% | |
| PA | 96 | 3.8% | |
| NJ | 94 | 3.8% | |
| VA | 78 | 3.1% | |
| MA | 73 | 2.9% | |
| Other values (36) | 929 | 37.2% |
Frequencies of value counts
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Home.Ownership
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 19.5 KiB |
| MORTGAGE | |
|---|---|
| RENT | |
| OWN | |
| OTHER | 5 |
| NONE | 1 |
| Value | Count | Frequency (%) | |
| MORTGAGE | 1147 | 45.9% | |
| RENT | 1146 | 45.8% | |
| OWN | 200 | 8.0% | |
| OTHER | 5 | 0.2% | |
| NONE | 1 | < 0.1% | |
| (Missing) | 1 | < 0.1% |
Frequencies of value counts
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.7568 |
| Min length | 3 |
Monthly.Income
Real number (ℝ≥0)
| Distinct | 632 |
|---|---|
| Distinct (%) | 25.3% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5688.931321 |
|---|---|
| Minimum | 588.5 |
| Maximum | 102750 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 19.5 KiB |
Quantile statistics
| Minimum | 588.5 |
|---|---|
| 5-th percentile | 2166.003 |
| Q1 | 3500 |
| median | 5000 |
| Q3 | 6800 |
| 95-th percentile | 11666.703 |
| Maximum | 102750 |
| Range | 102161.5 |
| Interquartile range (IQR) | 3300 |
Descriptive statistics
| Standard deviation | 3963.118185 |
|---|---|
| Coefficient of variation (CV) | 0.6966366725 |
| Kurtosis | 167.4344468 |
| Mean | 5688.931321 |
| Median Absolute Deviation (MAD) | 1666.67 |
| Skewness | 8.467690017 |
| Sum | 14216639.37 |
| Variance | 15706305.75 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5000 | 107 | 4.3% | |
| 4166.67 | 84 | 3.4% | |
| 3333.33 | 71 | 2.8% | |
| 5416.67 | 70 | 2.8% | |
| 5833.33 | 58 | 2.3% | |
| 3750 | 53 | 2.1% | |
| 6666.67 | 52 | 2.1% | |
| 2500 | 51 | 2.0% | |
| 4583.33 | 50 | 2.0% | |
| 6250 | 46 | 1.8% | |
| Other values (622) | 1857 | 74.3% |
| Value | Count | Frequency (%) | |
| 588.5 | 1 | < 0.1% | |
| 666.67 | 1 | < 0.1% | |
| 833.33 | 1 | < 0.1% | |
| 866.67 | 1 | < 0.1% | |
| 884.9 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 102750 | 1 | < 0.1% | |
| 65000 | 1 | < 0.1% | |
| 39583.33 | 1 | < 0.1% | |
| 27083.33 | 1 | < 0.1% | |
| 25000 | 4 | 0.2% |
FICO.Range
Categorical
| Distinct | 38 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 2 |
| Missing (%) | 0.1% |
| Memory size | 19.5 KiB |
| 670-674 | |
|---|---|
| 675-679 | 166 |
| 680-684 | 157 |
| 695-699 | 153 |
| 665-669 | 145 |
| Other values (33) |
| Value | Count | Frequency (%) | |
| 670-674 | 171 | 6.8% | |
| 675-679 | 166 | 6.6% | |
| 680-684 | 157 | 6.3% | |
| 695-699 | 153 | 6.1% | |
| 665-669 | 145 | 5.8% | |
| 690-694 | 140 | 5.6% | |
| 685-689 | 136 | 5.4% | |
| 705-709 | 134 | 5.4% | |
| 700-704 | 131 | 5.2% | |
| 660-664 | 125 | 5.0% | |
| Other values (28) | 1040 | 41.6% |
Frequencies of value counts
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Histogram of lengths of the category
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.9968 |
| Min length | 3 |
Open.CREDIT.Lines
Real number (ℝ≥0)
| Distinct | 29 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 3 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.07288746 |
|---|---|
| Minimum | 2 |
| Maximum | 38 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 19.5 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 7 |
| median | 9 |
| Q3 | 13 |
| 95-th percentile | 18 |
| Maximum | 38 |
| Range | 36 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.507416186 |
|---|---|
| Coefficient of variation (CV) | 0.44748005 |
| Kurtosis | 1.463708626 |
| Mean | 10.07288746 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.8866909421 |
| Sum | 25152 |
| Variance | 20.31680067 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=29)
| Value | Count | Frequency (%) | |
| 8 | 262 | 10.5% | |
| 9 | 237 | 9.5% | |
| 6 | 232 | 9.3% | |
| 7 | 216 | 8.6% | |
| 11 | 187 | 7.5% | |
| 10 | 185 | 7.4% | |
| 13 | 158 | 6.3% | |
| 12 | 153 | 6.1% | |
| 5 | 153 | 6.1% | |
| 14 | 138 | 5.5% | |
| Other values (19) | 576 | 23.0% |
| Value | Count | Frequency (%) | |
| 2 | 24 | 1.0% | |
| 3 | 60 | 2.4% | |
| 4 | 106 | 4.2% | |
| 5 | 153 | 6.1% | |
| 6 | 232 | 9.3% |
| Value | Count | Frequency (%) | |
| 38 | 1 | < 0.1% | |
| 36 | 1 | < 0.1% | |
| 34 | 1 | < 0.1% | |
| 31 | 1 | < 0.1% | |
| 26 | 3 | 0.1% |
| Distinct | 2349 |
|---|---|
| Distinct (%) | 94.1% |
| Missing | 3 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15223.18462 |
|---|---|
| Minimum | 0 |
| Maximum | 270800 |
| Zeros | 39 |
| Zeros (%) | 1.6% |
| Memory size | 19.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 916.2 |
| Q1 | 5584 |
| median | 10948 |
| Q3 | 18861 |
| 95-th percentile | 40768.4 |
| Maximum | 270800 |
| Range | 270800 |
| Interquartile range (IQR) | 13277 |
Descriptive statistics
| Standard deviation | 18281.01526 |
|---|---|
| Coefficient of variation (CV) | 1.200866685 |
| Kurtosis | 49.15169313 |
| Mean | 15223.18462 |
| Median Absolute Deviation (MAD) | 6191 |
| Skewness | 5.401569499 |
| Sum | 38012292 |
| Variance | 334195518.9 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 39 | 1.6% | |
| 2174 | 3 | 0.1% | |
| 12588 | 3 | 0.1% | |
| 6055 | 3 | 0.1% | |
| 15055 | 3 | 0.1% | |
| 7161 | 2 | 0.1% | |
| 14268 | 2 | 0.1% | |
| 6969 | 2 | 0.1% | |
| 20694 | 2 | 0.1% | |
| 1442 | 2 | 0.1% | |
| Other values (2339) | 2436 | 97.4% | |
| (Missing) | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 39 | 1.6% | |
| 1 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 270800 | 1 | < 0.1% | |
| 245886 | 1 | < 0.1% | |
| 217827 | 1 | < 0.1% | |
| 216561 | 1 | < 0.1% | |
| 194205 | 1 | < 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 3 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9066880256 |
|---|---|
| Minimum | 0 |
| Maximum | 9 |
| Zeros | 1249 |
| Zeros (%) | 50.0% |
| Memory size | 19.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.231149256 |
|---|---|
| Coefficient of variation (CV) | 1.357853221 |
| Kurtosis | 6.545444299 |
| Mean | 0.9066880256 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.042124554 |
| Sum | 2264 |
| Variance | 1.51572849 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 1249 | 50.0% | |
| 1 | 657 | 26.3% | |
| 2 | 336 | 13.4% | |
| 3 | 169 | 6.8% | |
| 4 | 50 | 2.0% | |
| 5 | 14 | 0.6% | |
| 6 | 8 | 0.3% | |
| 7 | 7 | 0.3% | |
| 9 | 5 | 0.2% | |
| 8 | 2 | 0.1% | |
| (Missing) | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1249 | 50.0% | |
| 1 | 657 | 26.3% | |
| 2 | 336 | 13.4% | |
| 3 | 169 | 6.8% | |
| 4 | 50 | 2.0% |
| Value | Count | Frequency (%) | |
| 9 | 5 | 0.2% | |
| 8 | 2 | 0.1% | |
| 7 | 7 | 0.3% | |
| 6 | 8 | 0.3% | |
| 5 | 14 | 0.6% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 77 |
| Missing (%) | 3.1% |
| Memory size | 19.5 KiB |
| 10+ years | |
|---|---|
| < 1 year | |
| 2 years | |
| 3 years | |
| 5 years | |
| Other values (6) |
| Value | Count | Frequency (%) | |
| 10+ years | 653 | 26.1% | |
| < 1 year | 250 | 10.0% | |
| 2 years | 244 | 9.8% | |
| 3 years | 235 | 9.4% | |
| 5 years | 202 | 8.1% | |
| 4 years | 192 | 7.7% | |
| 1 year | 177 | 7.1% | |
| 6 years | 163 | 6.5% | |
| 7 years | 127 | 5.1% | |
| 8 years | 108 | 4.3% | |
| (Missing) | 77 | 3.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.4284 |
| Min length | 3 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| LoanID | Amount.Requested | Amount.Funded.By.Investors | Interest.Rate | Loan.Length | Loan.Purpose | Debt.To.Income.Ratio | State | Home.Ownership | Monthly.Income | FICO.Range | Open.CREDIT.Lines | Revolving.CREDIT.Balance | Inquiries.in.the.Last.6.Months | Employment.Length | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 20000.0 | 20000.0 | 8.90% | 36 months | debt_consolidation | 14.90% | SC | MORTGAGE | 6541.67 | 735-739 | 14.0 | 14272.0 | 2.0 | < 1 year |
| 1 | 2 | 19200.0 | 19200.0 | 12.12% | 36 months | debt_consolidation | 28.36% | TX | MORTGAGE | 4583.33 | 715-719 | 12.0 | 11140.0 | 1.0 | 2 years |
| 2 | 3 | 35000.0 | 35000.0 | 21.98% | 60 months | debt_consolidation | 23.81% | CA | MORTGAGE | 11500.00 | 690-694 | 14.0 | 21977.0 | 1.0 | 2 years |
| 3 | 4 | 10000.0 | 9975.0 | 9.99% | 36 months | debt_consolidation | 14.30% | KS | MORTGAGE | 3833.33 | 695-699 | 10.0 | 9346.0 | 0.0 | 5 years |
| 4 | 5 | 12000.0 | 12000.0 | 11.71% | 36 months | credit_card | 18.78% | NJ | RENT | 3195.00 | 695-699 | 11.0 | 14469.0 | 0.0 | 9 years |
| 5 | 6 | 6000.0 | 6000.0 | 15.31% | 36 months | other | 20.05% | CT | OWN | 4891.67 | 670-674 | 17.0 | 10391.0 | 2.0 | 3 years |
| 6 | 7 | 10000.0 | 10000.0 | 7.90% | 36 months | debt_consolidation | 26.09% | MA | RENT | 2916.67 | 720-724 | 10.0 | 15957.0 | 0.0 | 10+ years |
| 7 | 8 | 33500.0 | 33450.0 | 17.14% | 60 months | credit_card | 14.70% | LA | MORTGAGE | 13863.42 | 705-709 | 12.0 | 27874.0 | 0.0 | 10+ years |
| 8 | 9 | 14675.0 | 14675.0 | 14.33% | 36 months | credit_card | 26.92% | CA | RENT | 3150.00 | 685-689 | 9.0 | 7246.0 | 1.0 | 8 years |
| 9 | 10 | 7000.0 | 7000.0 | 6.91% | 36 months | credit_card | 7.10% | CA | RENT | 5000.00 | 715-719 | 8.0 | 7612.0 | 0.0 | 3 years |
Last rows
| LoanID | Amount.Requested | Amount.Funded.By.Investors | Interest.Rate | Loan.Length | Loan.Purpose | Debt.To.Income.Ratio | State | Home.Ownership | Monthly.Income | FICO.Range | Open.CREDIT.Lines | Revolving.CREDIT.Balance | Inquiries.in.the.Last.6.Months | Employment.Length | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2490 | 2491 | 10000.0 | NaN | 11.71% | 36 months | debt_consolidation | 8.40% | CA | RENT | 4500.00 | 710-714 | 8.0 | 8404.0 | 1.0 | 3 years |
| 2491 | 2492 | 8475.0 | 8475.00 | 7.62% | 36 months | debt_consolidation | 15.88% | CA | RENT | 3983.33 | 720-724 | 9.0 | 6882.0 | NaN | NaN |
| 2492 | 2493 | 6400.0 | 6350.00 | 10.08% | 36 months | debt_consolidation | NaN | NJ | NaN | 5166.67 | 710-714 | 5.0 | 5815.0 | 2.0 | 10+ years |
| 2493 | 2494 | 30000.0 | 30000.00 | 23.28% | 60 months | other | 12.10% | IL | MORTGAGE | 7083.33 | 675-679 | 16.0 | 17969.0 | 1.0 | 10+ years |
| 2494 | 2495 | 24000.0 | 23975.00 | 14.65% | 36 months | debt_consolidation | 15.29% | WA | MORTGAGE | 6666.67 | NaN | 13.0 | 17521.0 | 0.0 | 5 years |
| 2495 | 2496 | 30000.0 | 29950.00 | 16.77% | 60 months | debt_consolidation | 19.23% | NY | MORTGAGE | 9250.00 | 705-709 | 15.0 | 45880.0 | 1.0 | 8 years |
| 2496 | 2497 | 16000.0 | 16000.00 | 14.09% | 60 months | home_improvement | 21.54% | MD | OWN | 8903.25 | 740-744 | 18.0 | 18898.0 | 1.0 | 10+ years |
| 2497 | 2498 | 10000.0 | 10000.00 | 13.99% | 36 months | debt_consolidation | 4.89% | PA | MORTGAGE | 2166.67 | 680-684 | 4.0 | 4544.0 | 0.0 | 10+ years |
| 2498 | 2499 | 6000.0 | 6000.00 | 12.42% | 36 months | major_purchase | 16.66% | NJ | RENT | 3500.00 | 675-679 | 8.0 | 7753.0 | 0.0 | 5 years |
| 2499 | 2500 | 9000.0 | 5242.75 | 13.79% | 36 months | debt_consolidation | 6.76% | NY | RENT | 3875.00 | 670-674 | 7.0 | 7589.0 | 0.0 | 10+ years |